Getting Started with the HLO Feature Dataset
- Introduction: Learn how to use the dataset for predicting deep learning training times using metadata and graph features.
- Prerequisites:
- Python 3.x
pandas
, numpy
, xgboost
- Steps:
- Download the dataset from Hugging Face: HLO Feature Dataset Repo
- Open the baseline notebook:
Baseline_XGBoost_Resource_Estimation.ipynb
- Follow step-by-step instructions to train your first resource estimation model.
- Outcome: A trained XGBoost model capable of predicting
fit_time
for DL workloads.